CDS
Accession Number | TCMCG075C25669 |
gbkey | CDS |
Protein Id | XP_007012469.2 |
Location | join(3314750..3315421,3315537..3315734,3316289..3316693,3317007..3317070,3318061..3318278,3318375..3318488,3318899..3319004,3319239..3319450,3320089..3320192,3320583..3320727) |
Gene | LOC18588176 |
GeneID | 18588176 |
Organism | Theobroma cacao |
Protein
Length | 745aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_007012407.2 |
Definition | PREDICTED: DNA cross-link repair protein SNM1 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGCTCTCGAGATCTTCCGCATCCCAGTTTCTTTCCACCGACGACGACGACGACGATTTTCAAGTTCCCCCAACTCAAACTCTTTCAGCTTCCATTAAACCCACCTCACACAAGAACCCCCTCAAGCCCTCCAATACTCCCCGCCCTCCTTCCAAGAAACCTAAACGCCCTGATAACCCACCCGGAAAAGAAAACGCTGCCGTTATCGCCATTCCGATAACCCGATCCAATGATCAGCCCGATCTCGATGAAACCTGCAGCTTGGATTTAATACCGTCCAGCATCAATTGTAGTTTTAATTTGACTTCAGCCCAAGATAGGGAGTCTGATTATGTAAAATGTGACGAAAAGAAAAAGGAGTTATTGGAATTGAATAAGGGTTACTTGTGTAATTCAGTAGAGTCGAGATTAATAAGGCCGAGATCAGAGTTAAGCGAGGAGTTCGGAGAAGATTTTGACGAAGATAATGAGCTTGATGCCTTACTTAAGCTATGCAACGACGTAGAAGAAGAAAAAGAAGAAGACAGTGGAGATGAAAAGGAAAGTAATGTTCTAGACAATAGTCTAGTTCAATGTCCTCTTTGTGGAGTTAATATTTCGGGTTTGAATGAAGAGCACCGACTGGTTCACATCAATGATTGTCTCGACAAAGTGGAGAATCCTGGTCAAAATGTTGTTTTTCCTCCTAGTGTTGACAGGGAATTTCAGTGCGTTCCTGAGGTTGTTGATGGTCCCCCTTTGTCTCCTCGACAAGTTGTTGATGTCTCCCCTGTTGTTAAATGGCTAAGTAATCTTGGTTTAGCAAGATATGCTGATGCTTTTGTCCGAGAAGAGGTTGATTGGGACACTCTGAAGTGGTTGACTGAAGAGGATTTGTTCAGCATTGGTGTTACTGCACTAGGCCCCAGGAAGAAGATTGTGCATGCTCTTAGTGAACTCAGAAAAAGCTACTCCTGTGCAGCTGAGAGGCACATGGGTCATCCCAGTCATGGAAATGGATCAGCCAAAAGCAGCAGAGCAAAGACGCAAACTGAGATTTCTAATTTTATAGATGACGAAACTACTAAGCCAGCTGCAAACAAGTTAATTACAGATTTTTTTCCTGGCTTGGTTTCTGACAGGAAGAAAGTTTGCACCCCTCCAAGAGGACAGCATATATCAAGCAAAAGTCACTCAGATCCTGGTCGTAGACGTGTGCAGACAAATCATGTTAAAAATGGAAAACTAAAAGATATTCCTGCATGGTGTTGCATTCCAGGAACACCATTTCGAGTGGATGCTTTCAAATATCTTCGAGGAGATTGTTCCCACTGGTTTCTCACACACTTCCATATGGACCATTATCAAGGATTAACAAGGTCTTTTCGTCATGGTAAGATTTACTGCTCCTCAATCACAGCACAGCTTGTAAATGTAAAGCTTGGAATACCATGGGAAAAGTTGCAAGTTTTACCCCTCAACCAAAAGATCAATATTGCTGGTATTGAGATAACATGCTTGGATGCAAATCACTGCCCAGGATCCATCATGATACTCTTTGTACCACCAAATGGTAAGGCTGTTCTACACACAGGAGATTTTCGCTTTTGTGAGGAAATGGCAAGCATGTCTCTTTGGCATGCTTGTCCTATACATACTCTCATCCTTGATACAACTTACTGTAATCCTCAGTATGACTTCCCAAAGCAGGAGGCTGTAATACAGTTTGTCATTGAGGCAATCCAAGCAGAGGCTTTCAACCCTAAGACACTTTTTCTGATTGGCAGCTACACAATTGGAAAGGAAAGGCTTTTCTTGGAGGTTGCTCGTGTCCTTCGTAGAAAGGTTTACATCACTGCAGCAAAGTTCCGTCTTTTGGATTGCTTGGGTTTCTCTGAGGAAGATATGCGGTGGTTCACACTTAATGAACAGGAAAGCCAGATCCATGTTGTCCCTATGTGGACACTTGCAAGCTTCAAACGATTGAAACACATATCTAACCAATATGCGGGTCGATTCAGTCTAATAGTTGCTTTCTCTCCTACGGGTTGGGCACTTGGTAAGGGGAAGAAAAAGGCTCCAGGGAGAAGGTGGCAGCAGGGTACAATCATCAGGTACGAAGTGCCATATAGTGAGCATTGCAGCTTTACAGAGCTCAAAGAATTTGTGAAAATTTTATCTCCCGAAAACATAATACCAAGTGTGAATAATGATGGACCAGATTCTACCAAAGCCATGATTTCCCTCCTGTTGCCTTGA |
Protein: MLSRSSASQFLSTDDDDDDFQVPPTQTLSASIKPTSHKNPLKPSNTPRPPSKKPKRPDNPPGKENAAVIAIPITRSNDQPDLDETCSLDLIPSSINCSFNLTSAQDRESDYVKCDEKKKELLELNKGYLCNSVESRLIRPRSELSEEFGEDFDEDNELDALLKLCNDVEEEKEEDSGDEKESNVLDNSLVQCPLCGVNISGLNEEHRLVHINDCLDKVENPGQNVVFPPSVDREFQCVPEVVDGPPLSPRQVVDVSPVVKWLSNLGLARYADAFVREEVDWDTLKWLTEEDLFSIGVTALGPRKKIVHALSELRKSYSCAAERHMGHPSHGNGSAKSSRAKTQTEISNFIDDETTKPAANKLITDFFPGLVSDRKKVCTPPRGQHISSKSHSDPGRRRVQTNHVKNGKLKDIPAWCCIPGTPFRVDAFKYLRGDCSHWFLTHFHMDHYQGLTRSFRHGKIYCSSITAQLVNVKLGIPWEKLQVLPLNQKINIAGIEITCLDANHCPGSIMILFVPPNGKAVLHTGDFRFCEEMASMSLWHACPIHTLILDTTYCNPQYDFPKQEAVIQFVIEAIQAEAFNPKTLFLIGSYTIGKERLFLEVARVLRRKVYITAAKFRLLDCLGFSEEDMRWFTLNEQESQIHVVPMWTLASFKRLKHISNQYAGRFSLIVAFSPTGWALGKGKKKAPGRRWQQGTIIRYEVPYSEHCSFTELKEFVKILSPENIIPSVNNDGPDSTKAMISLLLP |